UNCUSSIFIO 


NL 


ff 


®W.flL£  CO pv 


APPLICATIONS  OF  SIGNAL  PROCESSING 


IN  DIGITAL  COMMUNICATIONS 


Principal  Investigator:  Michele  Elia 


Contractor:  Politecnico  di  Torino 
Corso  Duca  degli  Abruzzi  24  -  1-10129  TORINO  (Italy) 


Contract  number  DAJA45-86-C-0044 


Fourth  Interim  Report 
(November  1987  -  April  1988) 


DTTC 

Selected* 

JUN07B68S  i 


The  Research  reported  in  this  document  has  been  made  possible  through 
the  support  and  sponsorship  of  the  U.S.  Government  through  its  European 
Research  Office  of  the  U.S.  Army.  TCT^-TBpwr^,1,fS— 


"bisnuBimoN 

-Apptor«Ttoi  public  tofea-oj  j 

Distribution  Unli^u***^— - — - 


»8  6  6  175' 


Our  research  activity  during  the  period  covered  by  this  report 
was  focused  on  the  study  of  computational  complexity  related  to 
error  correcting  codes  and  more  in  general  to  arithmetics  over 
real  and  finite  fields. 

The  importance  of  computational  complexity  in  the  design  of  any 
coding  scheme  either  combined  or  not  with  modulations  need  not  be 
explained.  Our  aim  is  to  put  together  several  scattered  results 
and  gather  experience  in  manipulating  such  concepts  in  order  to 
exhibit  an  organic  view  of  computational  problems  that  are  pres¬ 
ent  in  the  design  of  encoder  decoder  and  demodulator  with  VLSI 
technologies. 

Two  papers  are  herewith  enclosed.  The  first  one  concerns  the 
complexity  of  power  computations  over  any  commutative  ring  with 
identity,  i.e.  finite  fields,  ring  of  matrices  etc. 

The  second  one  concerns  the  decoding  complexity  of  nonlinear 
codes  and  related  topics  connected  with  the  computation  of  Hadam- 
ard  Transforms,  vector  quantization  and  soft  decoding  of  block 
codes . 

The  paper  "A  Note  on  Addition  Chains  and  Some  Related  Conjectu¬ 
res"  will  be  presented  at  the  Advanced  International  Workshop  on 
SEQUENCES,  Combinatorics,  Compression,  Security  and  Transmission, 
to  be  held  in  Amalfi  Coast,  Salerno,  Italy,  June  1988. 

The  paper  "A  note  on  the  Complete  Decoding  of  Kerdock  codes"  will 
be  presented  at  IEEE  International  Symposium  on  Information  Theo¬ 
ry  to  be  held  in  Kobe,  JAPAN,  June  1988. 


REPORT  DOCUMENTATION  PAGE 


4.  TITLE  (md  Sa Mil)  . 

Applications  of  Signal  Processing 
in  Digital  Communications 


7.  AUTHOR!*; 

Ezio  Biglieri  and  M.  Elia 


%.  TYRE  OP  REPORT  *  PERIOD  COVERED 

I  Nov.  1987  -  Arril  1988; 
4th  Interim  Report 


6.  PERFORMING  ORC-  REPORT  NUMBER 


6.  CONTRACT  OR  GRANT  NUMBERS 

DAJA45-86-C-0044 


II.  CONTROLLING  OFFICE  NAME  ANO  AO  OR  ESS 

U.S. Array  Research,  Development  & 
Standardization  GRoup  -  UK 


12.  REPORT  OATE 


I 


April  30,  1988 


1).  NUMBER  OF  PAGES 

30 


.  MONITORING  AGENCY  NAME  *  AOOR CSV"  different  from  Controlling  Offlc e)  IS-  SECURITY  CLASS.  (of  « 

Unclassified 


17.  DISTRIBUTION  STATEMENT  (oi  the  ebetreet  entered  In  Sleek  30,  II  different  from  Report) 


It.  KE^ BOROS  (Caeltmie  on  revoree  elde  II  neeoeoery  end  Identity  kf  block  number) 

;  Digital  Communications,  Group  Codes ,i Computational  Complexity^. 


A  RETRACT  fCnd—»  w  warn  mtdm  H  PWfMa  ond  Idomtity  by  block  i—HQ 


^"We  consider  the  design  of  multidimensional  signal  sets  and 
their  combination  with  block  or  trellis  codes.  The  goal  is 
to  achieve  a  high  efficiency  in  the  use  of  f reguencvsspec- 
trum  for  digital  communications.^ 

r  r.  r  -  i 


i  jam  w  1473  eomoR  op  i  hov  «» i$  orsolete 


SECURITY  CL  ASS!  PIC  AT  ION  OF  THU  PACE  (Wttoo  Doto  Eolocod) 


A  Note  on  Addition  Chains  and  some  Related 
Conjectures  * 

M.  Elia  and  F.  Neri 
Dipartimento  di  Elettronica 
Politecnico  di  Torino  -  Italy 

March  7th,  1988 


Abstract 

Addition  chains  arc  finite  increasing  sequences  of  positive  integers, 
useful  for  the  efficient  evaluation  of  powers  over  rings.  Many  features 
of  the  addition  chains  are  discussed,  and  some  theorems  connected  to 
the  still  open  Schols-Brauer  conjecture  are  presented. 


1  Introduction 

In  many  fields,  such  as  computer  science,  applied  number  theory,  cryptog¬ 
raphy,  or  numerical  analysis,  an  efficient  computation  of 

Xn  =  XX  ...  X  (1) 

is  required,  where  n  is  a  positive  integer  (n  €  Z)  and  x  can  belong  to  any 
algebraic  system  R  (usually  a  ring)  in  which  an  associative  multiplication 
with  identity  is  defined.  This  implies  that  sums  and  products  can  take  a 
significantly  different  amount  of  time  when  applied  to  n  (i.e.  in  Z)  or  to  z 
(i.e.  in  ft). 

The  problems  typical  of  the  evaluation  of  powers  have  been  extensively 
discussed  by  Knuth  [1]  and  by  Borodin  and  Munro  [2],  Several  schemes  have 
been  proposed,  in  order  to  minimize  the  efforts  (i.e.  number  of  multiplica¬ 
tions)  for  evaluating  (1),  but  it  seems  that  none  can  be  definitely  preferable 
in  the  general  case.  The  choice  of  an  approach  instead  of  the  other  is  affected 
by  a  number  of  constraints,  aims  or  available  resources,  namely: 

’This  work  was  financially  supported  in  part  by  the  United  Statee  Army  through  ite 
European  Research  Office,  under  grant  n.  DAJA4S-86-C-O044. 
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•  the  order  of  magnitude  of  the  exponent  n; 

•  the  availability  of  storage  for  precomputed  tables; 

•  whether  the  situation  calls  for 

1.  independent  evaluations  of  the  power  (1) 

2.  evaluations  of  several  powers  of  the  same  base  z 

3.  evaluations  of  several  powers  to  the  same  exponent  n. 

As  an  example,  consider  the  generation  of  a  pseudo-random  sequence 
by  purely  multiplicative  congruentia!  methods,  i.e.  the  desired  sequence 
<  x„  >  is  obtained  by  the  iterative  relation 


zn+l  =  axn  mod  m.  (2) 

A  desirable  feature  for  relation  (2)  is  to  generate  a  sequence  with  maximum 
period;  if  m  is  a  prime  number,  this  is  achieved  whenever  the  multiplier  a 
is  a  primitive  element  in  the  finite  field  GF(m).  The  test  for  a  number  to 
be  primitive  can  be  a  quite  formidable  task,  however  it  can  be  obtained  by 
raising  the  number  being  tested  to  quantities  related  to  the  factors  of  m  -  1, 
as  shown  in  [3],  These  exponents  have  the  same  order  of  magnitude  of  m, 
hence  they  are  rather  sizable  for  non  trivial  periods  of  interesting  sequences 
<  x„  > ;  moreover  all  of  the  operations  must  be  done  fully  exploiting  finite 
size  registers  if  long  periodicity  is  desired  (see  [3]),  so  that  even  the  simple 
multiplication  can  be  fairly  costly.  In  this  situation  the  efficiency  in  the 
computation  of  (1)  is  crucial. 

2  Power  Evaluations 

At  a  first  sight  a  very  economical  evaluation  of  (1)  is  obtained  by  the  binary 
decomposition  of  the  exponent  n,  which  leads  to  a  number  of  multiplications 
upper  bounded  by  2  log]  n.  The  same  decomposition  implies  the  simple  but 
tight  lower  bound  flog]  n] .  Most  considerations  about  the  evaluation  of 
powers  concern  the  estimation  of  tighter  upper  bounds. 

If  we  write 


n  =  2>y,  fce{0,l},  (3) 

i=0 
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where  t  -  [log,  nj,  the  power  (1)  can  be  computed  as 

.=0 

Given  that  the  &,’s  can  be  only  0  or  1,  raising  to  4,  is  straightforward.  We 
shall  call  this  approach  right  to  left  binary  method. 

In  (4)  t  —  1  multiplications  are  required  to  evaluate  the  powers 

**'  (5) 

and  one  more  multiplication  is  needed  for  every  non  zero  4,,  leading  to  a 
total  of 

[log,  »J  -  1  -f-j/(n)  (6) 

multiplications,  where  i/(n)  is  the  number  of  l’s  in  the  binary  representation 
of  n.  The  storage  required  by  an  implementation  of  the  binary  method  (4) 
can  be  reduced  to  three  memory  cells:  one  to  hold  the  successive  powers  (5), 
another  to  hold  n  during  its  decomposition,  and  an  accumulator  for  the 
result. 

The  right  to  left  binary  method  can  be  generalised  to  an  m-ary  method 
in  the  following  way  [4].  Assume  the  task  to  be  the  computation  of  (l)  with 
the  constraint 

n  <  k,  k  fixed,  (7) 

and  let 

*  =  (8) 
The  exponent  n  can  be  decomposed  as 
i 

n  =  ^2dim\  di  €  (0,l,...,m- 1).  (9) 

i=0 

This  decomposition  can  be  rewritten  as 

n  =  ^  m’  +  2  m'  +  ...  +  (m  -  1)  m',  (10) 

•G^i  *€/*  i€Jm- 1 

where  J,  denotes  the  set  of  indices  such  that  the  coefficients  d,  in  (9)  are 
equal  to  j. 
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The  right  to  left  M-ary  method  can  be  described  by  the  following  pro¬ 
cedure. 

Step  1.  COMPUTE  AND  STORE  (11) 

zm,zm\zmS,...,zm';  (t  as  in  (8)) 

Step  2.  FOR  EVERY  j  6  1 ...  m  -  1 

COMPUTE  XX j  =  X  m 

Step  3.  COMPUTE  (1)  AS 

n 

Step  1  of  procedure  (11)  requires  at  most  t/(m)  multiplications,  if  1 (m)  is  the 
minimum  number  of  multiplications  for  raising  a  number  to  its  m-tb  power: 
actually,  in  the  average,  not  all  the  terms  in  Step  1  will  be  necessary.  Raising 
to  j  in  Step  2  requires  I(j)  multiplications,  while  the  remaining  operations 
in  Steps  2  and  3  can  be  carried  out  with  no  more  than  t  —  1  multiplications. 
The  total  number  of  multiplications  is  bounded  by 

m—  1 

tl(m) +  t  -  1  +  £)  l(j)  (t  as  in  (8)).  (12) 

Another  way  of  computing  (1)  is  to  rewrite  the  exponent  n  from  (3)  by 
Horner’s  rule  for  evaluating  polynomials 

n  =  bo  +  2(6i  +  2(6*  +  2(6j  +  (...  +  26() . . .))). 

We  shall  refer  to  this  approach  as  left  to  right  binary  method,  since  a  left  to 
right  scanning  of  n’s  binary  representation  is  required. 

The  left  to  right  binary  method  can  be  extended  to  an  m-ary  method, 
as  described  by  the  following  procedure. 

Step  1.  COMPUTE  AND  STORE  (13) 

*,  x* 

Step  2.  LET  •  =  t; 

START  WITH  z * ; 

Step  3.  REPEAT 

LET  i  =  i  -  I; 

RAISE  TO  THE  m-TH  POWER; 

IF  ii  IS  NOT  0 
MULTIPLY  BY  z*; 

UNTIL  l  =  0; 
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Table  1:  Number  of  multiplications  in  computing  (1). 


base 

m 

right  to  left 
procedure  (11) 

left  to  right 
procedure  (13) 

□ 

2  [log,  nj  -  1 

2  [log,  nj 

I 

3  [log,  nJ 

3  [logs  nJ  +  1 

HE 

3[log4nJ  +2 

3[log4  nj  +  2 

4[logs  nj  +  4 

4[log,  nj  +  3 

4[log6  nj  +  7 

4  [logg  nJ  +  4 

I 

5[log7  nj  +  10 

5[log7  nj  +5 

8 

4[logg  nj  +  14 

4[logs  nj  +  6 

Note  that  a  certain  amount  of  storage  is  necessary  for  the  quantities  com¬ 
puted  in  the  first  step  of  the  above  procedure;  moreover  the  representation 
base  m  of  n  must  be  available  in  a  left  to  right  order. 

Step  1  of  procedure  (13)  requires  at  most  m~2  multiplications;  actually 
the  xd'  do  not  need  to  be  computed  for  those  values  of  di  not  present  in 
the  decomposition  (9).  Each  iteration  of  Step  3  requires  at  most  f(m)  +  1 
multiplications,  if  l (m)  is  the  minimum  number  of  multiplications  for  raising 
a  number  to  its  m-th  power;  the  +1  is  present  only  if  the  t'-th  d,-  is  not  0. 
In  total  the  number  of  multiplications  is  bounded  by 

m  -  2  +  t  (l(m)  +  1)  (t  as  in  (8)).  (14) 

By  comparing  the  bounds  (14)  and  (12)  Table  1  can  be  built,  where  t  is 
expressed  as  in  (8).  The  order  of  magnitude  of  the  exponent  n  (given  by  k 
in  (7))  can  be  seen  to  affect  the  choice  of  the  base  m;  the  optimal  m  increases 
with  k.  Moreover,  those  bases  that  are  powers  of  2  appear  somehow  optimal, 
since  they  lead  to  comparatively  small  coefficients  for  [logm  nj  in  Table  1. 

Note  that  all  the  bounds  were  derived  neglecting  the  cost  of  decomposing 
the  exponent  n,  and  of  related  operations,  such  as  the  arrangement  in  (10) 
of  n’s  terms  in  the  sets  Jj.  This  can  be  an  acceptable  approximation  since 
the  operations  on  z  can  be  considered  to  be  much  more  expensive  than  the 
operations  on  n. 

Even  if  the  left  to  right  m-ary  method  seems  to  behave  better  for  large 
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bases  m,  a  careful  inspection  of  the  bounds  (14)  and  (12)  shows  that  the 
bound  (12)  is  weaker,  since  Steps  1  and  2  of  procedure  (11)  are  open  to 
several  optimizations  both  in  the  case  of  few  and  the  case  of  many  terms  in 
the  decomposition  (9) 

When  several  powers  of  the  same  base  z  are  to  be  done  the  precompu¬ 
tation  Step  1  can  be  executed  only  once  in  both  procedures  (13)  and  (11); 
in  this  case  the  right  to  left  method  becomes  extremely  advantageous,  since 
the  precomputation  in  (11)  is  much  heavier,  and  the  bound 

(m-l 

<  - 1  +  Xj  iu) 

can  be  obtained,  where  p  is  the  number  of  powers  to  be  computed. 


3  Asymptotical  Behavior:  a  Bound  for  l(n) 

In  this  Section  it  is  shown  that  both  the  upper  and  the  lower  bounds  pre¬ 
sented  in  the  previous  Section  are  asymptotically  (for  large  exponents  n) 
equivalent. 

bet  us  consider  the  m-ary  methods,  and  substitute 

<=  ll°8m  "J 

in  (12);  the  minimum  number  of  multiplications  for  raising  to  n,  l(n),  is 
bounded  by  the  number  of  multiplications  required  by  the  m-ary  method 
(which  is  not  optimal  in  general) 


log,  n  <  flog,  n]  <  l(n)  <  [logm  njf(m)  +  [log*,  nj  -  1  +  £  /(*');  (15) 

»'=j 

m  is  assumed  to  be  the  optimal  base  for  the  given  n.  In  the  rigthmost 
inequality  of  relation  (15),  the  [oj  function  can  be  neglected  for  large  n’s; 
hence,  after  a  base  change  in  the  logarithms,  it  can  be  rewritten  as 


and 


m— 1 

l(n)  In  m  <  (l(m)  -+l)lnn-lnro-)-lnm  Y"  1  (t) 

•=* 


Inm  l(n)  _ 1_.  _  lam  lnm  T-.1  l(i) 

f(m)  Inn  “  '  f(m)  lnnf(m)  Inn  “  f(m) 


(16) 
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Since  the  optimal  base  has  been  shown  to  increase  when  n  increases,  and  m 
in  (IS)  is  chosen  to  be  the  optimal  base,  all  the  terms  in  the  right  hand  part 
of  (16)  become  negligible  for  large  n,  and  we  get 


\  ton 
f(n)  «  — , 


k  = 


In  m 
l(m) 


The  limiting  values  of  the  constant  k  can  be  obtained  by  taking 


m  =  2*, 


for  which  we  have 


lnm  =  sin  2 


l(m)  =  « 

.  lnm  ,  „ 
k  ss  — — r  n  In  2 
/(m) 

and  the  final  result 

/(n)  *s  log,  n. 

Note  that  Knuth  [1,  page  451,  Theorem  D]  gives  a  totally  different  proof, 
due  to  Brauer  [5],  of  this  fact. 


4  Addition  Chains 

Addition  chains  are  the  proper  tool  for  solving  the  problem  of  computing  (l) 
for  a  given  n  with  the  minimum  number  of  multiplications,  i.e.  to 

find  l(n).  minimum  number  of  prodncta  for  evol-  . . 

noting  the  n-tk  power.  '  ' 

Note  that  this  problem  is  only  a  particular  case  of  problem  (1),  in  the  sense 
that  nothing  is  said  about  the  coat  of  deriving  /(n);  and  this  cost  can  exceed 
by  far  the  cost  of  computing  (1)  by  anyone  of  the  previously  quoted  methods. 
Hence  the  addition  chains’  approach  to  the  evaluation  of  powers  is  of  interest 
either  when  several  quantities  need  to  be  raised  to  a  same  fixed  exponent, 
or  in  the  case  where  enough  storage  is  available  to  store  the  precomputed 
addition  chains  for  all  the  possible  values  of  the  exponent  n. 

Let’s  see  the  formal  definition  of  addition  chains. 

An  addition  chain  for  n  is  a  sequence  of  integers 

l  =  ao<ai<o,  <...<0t  =  « 
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with  the  property  that,  for  every  t,  a  couple  (j,k)  can  be  found,  such  that 

a,  -  oy  +  ak,  i  >  j\  i  >  k.  (18) 

Without  loss  of  generality,  the  a,  ’s  are  assumed  to  be  sorted  in  ascending 
order,  and  with  no  duplications.  Let 

l(n )  =  minimum  r  for  which  there  exists  an  addition  (19) 
chain  of  length  r  for  n; 

then  this  addition  chain  is  a  solution  to  problem  (17). 

It  is  convenient  to  define  two  special  classes  of  addition  chains.  A  star 
chain  is  defined  as  in  (18)  with  the  stronger  constraint  j  =  «  -  1.  An  l°-chain 
is  an  addition  chain  with  some  marked  elements;  the  condition  is  that  in  (18) 
ay  is  the  largest  marked  element  leas  that  ay.  It  can  be  shown  that 

J(n)  <  I°(«)  <  !*(n),  (20) 

where  l°(n)  and  f*(n)  are  defined  as  in  (19),  respectively  for  !°-chains  and 
star  chains. 

A  lot  has  been  written  about  addition  chains  (see  (1]  for  a  presentation  of 
the  main  results),  but  the  problem  of  finding  f(n)  is  not  completely  settled, 
in  the  sense  that  f(n)  is  not  known  for  all  n’s. 

4.1  Functions  Related  To  Addition  Chains 

Two  interesting  functions  are  related  to  f(n)  in  (19);  they  are  defined  as 
follows. 


c(r)  =  minimum  integer  n  that  l(n)  =  r  (21) 

d(r)  =  number  of  solutions  in  n  to  the  equation  l(n)  =  r  (22) 

From  the  previous  sections,  the  function  /(n)  is  known  to  satisfy  the 
following  bounds 

[log,  nj  <  l(n )  <  [log,  nj  +  u{n)  -  1,  (23) 

where  «/(n)  is  defined  in  (8).  Since  i/(n)  <  [log,  n],  and 
[log,  nj  +  [log,  n]  <  2  [log,  nj  +  1, 
the  bounds  (23)  can  be  rewritten  as 

[logj  <  IH  <  2 [log,  nj.  (24) 
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For  a  generic  n,  for  which  l(n)  —  r,  the  following  bounds  hold 

c(r)  <  n  <  2r;  (25) 

the  lower  bound  is  straightforward  from  the  definition  (21)  of  e(r),  while  the 
upper  bound  derives  from  the  lower  bound  in  (23).  Substituting  n  =  c(r) 
in  (24),  we  can  write 

log,  c(r)  <  r  <  2  log,  c(r)  (26) 

and,  by  combining  the  upper  bounds  in  (25)  and  (26),  we  get 

2r/i  <  c(r)  <  2r.  (27) 

From  the  lower  bound  in  (27),  the  following  holds  for  the  function  d{ o) 
defined  in  (22) 

2r/,<X>(0- 

(=1 

From  (25),  and  from  the  definition  of  d(r),  the  following  inequality  can  be 
stated 

d(r)  <  2'  -  c(r)  +  1; 

hence 

d(r)  +  c(r)  <2'  +  l. 

Another  bound  on  c(r)  can  be  derived  in  the  following  way. 

£  (<**)  +  m)  <  E(2‘  +  1)  =  2r  -  2  +  r  -  1 
1=1  t=l 

r-1 

'£c(t)  +  2'li<2'  +  r-3 
1=1 

r-1  r-1 

J22*'7  ^  ^  2r  -  2r/J  +  r  -  3 

1=  l  <=i 

r— 1 

(2l/2  +  l)(2r/2  -  1)  -  1  <  £  e(0  <  2r  -  2r/2  +  r  -  3. 

1  =  1 

Asymptotically: 

2*r+,s  <  £c(t)  <  (2r/2  -  |V  +  r. 

1=1  '  Z'' 
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Table  2: 


rr. 

IBS 

KOI 

ESI 

E3S1 

Se(r) 

2' 

1.41 

2 

1 

2 

3 

3 

2.82 

5 

6 

10 

8 

4 

7 

11 

17 

16 

5 

5.65 

11 

20 

28 

32 

6 

8 

15 

19 

35 

47 

64 

7 

11.3 

26 

29 

61 

76 

128 

8 

16 

44 

47 

105 

123 

256 

9 

22.6 

78 

71 

183 

194 

512 

10 

32 

136 

127 

319 

321 

1024 

11 

45.2 

246 

191 

565 

512 

2048 

12 

64 

432 

397 

997 

909 

4096 

13 

90.5 

772 

607 

1769 

1516 

8192 

14 

128 

1087 

2603 

16384 

15 

181.0 

1903 

4506 

32768 

From  this  and  the  previous  relations,  a  likely  conjecture  is  that  the  func¬ 
tions  c(r)  and  d(r)  behave  asymptotically  as  the  r-th  power  of  2: 

d(r)  =  O  {2d’r) 

c(r)  =  O  (2c'r) 

where  dr  and  cr  are  constants  between  0.5  and  1. 

The  values  of  c(r)  and  d(r)  for  some  small  values  of  r  are  shown  in 
Table  2.  The  columns  labeled  as  S<|(r)  and  Sc(r)  show  the  quantities 

$«(••)  = 

t= 1 

s.W=tcW- 

t=l 
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5  The  Scholz-Brauer  Conjecture 


A  famous  problem  concerning  addition  chains  is  the  Scholz-Brauer  conjec¬ 
ture  [6],  referring  to  chains  for  2"  -  1,  which  are  of  special  interest,  since 
they  are  the  worst  case  for  the  binary  method  (their  binary  representation  is 
a  stream  of  consecutive  l’s).  Let  us  call  a  number  n  satisfying  the  inequality 

<( 2"  -  1)  <  n  -  1  +  f(n),  (28) 

where  l(n)  is  defined  in  (19),  a  SB-number.  The  longstanding  Scholz-Brauer 
conjecture  states  that 

all  positive  integers  are  SB-munbers. 

In  the  following,  it  will  be  shown  that  (28)  holds  for  infinitely  many  n’s. 
Let  us  recall  some  of  the  properties  of  l(n),  reported  from  [1];  they  will  be 
useful  in  the  sequel. 

f(nm)  <  !(n)  +  l(m);  (29) 

1(2«)  =  a; 

l(2“  +  2*)  =  a+l  if  a  >  6  >  0;  (30) 

1(2“  +•  2k  +  2“)  =  a  +  2  ifn>fc>c>0 

(this  is  Theorem  B  in  (lj); 

a  +  2  <  1(2“  -f  2*  -b  2e  +  2*)  <  a  +  3  if  a  >  k  >  c  >  d  >  0, 

where  n  =  2'*  +  2k  +  2'  +  2'iia  said  to  be  special  (see  (1,  p.449()  if  the  tower 
bound  holds  with  equality  (this  is  called  Theorem  C  in  jl]); 

1°(2"  —  1)  <  r»  -  1  +  l°(n);  (31) 

this  implies  that  the  Shols-Brauer  conjecture  holds  for  1° -chains  (the  result, 
due  to  Hansen,  is  called  Theorem  G  in  jlj). 

Lemma  1  For  everf  integer  a,  the  following  inequality  hold* 

1(2**  -  1)  <  2“  -  1  +  o.  (32) 
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Proof  -  It  is  direct  that  (32)  holds  for  a  =  0.  Now  let  us  suppose  (32)  to 
be  satisfied  for  a  —  1;  thus,  using  (M)  and  (30),  we  have 

L{f'  -  1)  =  /  ((2l*  ‘  -  1)(2J*~‘  +  1))  < 

</(2,“‘  -1)+1(2J*'‘  +  1)  < 

<  2“_1  -  l  +  a  -  1  +  2a_1  +  1  < 

<  2“  -  1  +  a. 

The  validity  of  (32)  for  every  a  follows  from  the  induction  principle. 

□ 

Note  that  the  recursive  argument  used  in  the  proof  above  also  defines 
an  addition  chain  which  contains  numbers  of  the  form 

2*  (2,k  —  1)  0<Jfc<2\  l  <  h  <  a  ~  l.  (33) 

For  later  use,  we  state  this  point  as  a  Corollary. 

Corollary  1  There  exists  an  addition  chain  for  2J*  -  1  of  length  2a  -  1  +  a, 
such  that  it  contains  the  numbers  (39).  This  addition  chain  has  the  form 

■  ■  • ,  (2,k  -  l),2(2*k  -  1), . . .  ,2*k  (2*‘  -  1),  (2l‘+1  -  1), . . . 

Note  that 

2J‘(2,k  -  1)  +  (2*k  -  1)  =  2*‘+‘  -  22“  +  2*‘  -  1  =  2,k+*  -  1. 


Theorem  X  For  everg  positive  integer  n  the  inequality 

1(2"  -  1)  <  n  -  2  +  f(n)  +  [logj  nj  (34) 


holds. 

Proof  -  By  decomposing  n  into  its  binary  representation  as  in  (3),  we  can 
write 

2"  -  1  =  •<*’(2*,*#  -  1)  +  2^»=® *■*' (2*'~ 1  -  1)  +  . . .  +  (2*°  -  X)  = 

=  £2£&‘<*‘(2*/*’-l).  (35) 

>=o 
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Applying  Corollary  1  it  can  be  seen  that  all  the  i/(n)  terms  in  the  summa¬ 
tion  but  the  first  are  in  the  chain  for  (2*  -  1),  whose  length,  according  to 
Lemma  1,  is  bounded  by  2*  -  1  + 1.  Since  the  first  factor  in  the  first  term 
can  be  expressed  as  2n~1',  it  accounts  for  at  most  n  -  2*  multiplications. 
Combining  these  two  contributions  with  the  i/(n)  -  1  additional  multipli¬ 
cations  required  by  the  K(n)  not  zero  terms  in  the  decomposition  (35),  the 
Theorem  is  proved. 

□ 

Lemma  2  //f(n)  =  i*(n)  then  n  it  a  SB-numter. 

Proof  -  Straightforward  from  (20)  and  (31). 

Lemma  3  If  l(n)  =  [logj  nj  +  f(n)  —  1  then  n  it  a  SB-number. 

Theorem  2  Every  n  such  that  i/(n)  it  not  greater  than  4  it  a  SB-namber. 

Proof  -  The  proof  of  Theorem  2  is  given  separately  for  the  four  cases 
i/(n)  =  1 . 4. 

Case  i/(n)  =  1  -  Proved  in  Lemma  1. 

Case  i/(n)  —  2  -  It  must  be  shown  that,  for  every  integer  a  and  &  such  that 
a  >  b  >  0,  the  following  inequality  holds 

f(2**+,‘  -  1)  <  2*  +  2‘  +  a. 

We  can  write 

2s‘+3‘-l  =  2J*(2,‘-l)  +  2,‘-l. 

Prom  Corollary  1  we  know  that  2*'  -  1  belongs  to  the  addition  chain 
ending  in  2**  -  1,  so  that,  using  Lemma  1  we  have 

i(2J*+*‘  -  1)  <  /(2s*  -  1)  +  2*  +  1  <  2°  +  2*  +  a. 

a 

Case  i/(n)  =  3  -  It  must  be  shown  that,  for  every  a,  b  and  c  such  that 
a  >  b  >  c  >  0,  the  following  inequality  holds 

l(2**+**+r  —  i)<2*  +  2*+2e  +  a  +  l. 


In  a  way  similar  to  the  case  i>(n)  =  2,  using  Corollary  1  and  Lemma  1, 
the  proof  stems  from  the  equality 

2i*+*‘+1*  -  I  =  21‘+,‘(2l*  -  1)  +  2J‘(2J‘  -  1)  +  22'  -  1. 


Case  u(n)  =  4  -  Two  subcases  must  be  considered:  /(n)  =  a+3  and  l(n)  = 
a +  2.  In  the  first  case  the  proof  follows  from  Theorem  1.  In  the  second 
case  it  follows  from  Exercise  13  in  [1,  p.  463]  —  showing  that  n  has  a 
star  chain  so  that  Lemma  2  applies  —  and  (31).  □ 


6  Conclusions 

Knuth  reports  that  1  <  n  <  18  and  sporadic  20,  24  and  32  are  SB-numbers. 
We  have  shown,  as  a  consequence  of  Theorem  2,  that  all  1  <  n  <  30  are 
SB-numbers.  For  n  =  31  Lemma  1  gives 

31  <  /(2s1  -  1)  <  31  +  7, 

being  the  upper  bound  slightly  greater  than  31  +  6,  which  comes  from  (28). 
Therefore  31  is  the  smallest  integer  that  might  not  be  a  SB-number. 

For  numbers  of  the  form  2”  - 1,  since  logj  n  >  v  (n)  - 1 ,  the  inequality  (34) 
can  be  rewritten  as 

1(2"  -  1)  <  n  -  1  +  c  log,  n,  (36) 

where  c  is  a  convenient  constant  1  <  c  <  2.  This  result  implies  that  the 
numbers  of  the  form  2n  —  1  behave  better  than  Erdos’  probabilistic  bound  [7] 

=  log,  n  +  °  ■  (37) 

In  fact,  the  second  term  at  the  right  hand  side  of  (37),  in  this  case,  has  the 
form 

log,(r  -  1)  ^  n 

log,  log,  (2"  -  1)  log,  n 

and,  for  large  n’s 

.  n 
e  log,  n  <  r— — 
log,  n 

As  a  consequence  of  the  results  presented  in  this  paper,  an  interesting 
open  question  is  to  find  the  smallest  value  of  c  such  that  (96)  holds  for  every 
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Abstract 

A  definition  of  the  Kerdock  code  K{m)  ia  given  that  allows  instan¬ 
taneous  encoding  and  the  application  of  different  complete  decoding 
strategies.  The  particularly  interesting  code  K( 4)  is  thoronghly  de¬ 
scribed,  with  applications  to  error  correction  and  to  vector  quantisa¬ 
tion.  In  both  cases  the  complexity  in  terms  of  number  of  arithmetical 
and  logical  operations  is  discussed.  Finally  the  bit  error  rate  for  K  (4) 
on  the  binary  symmetric  channel  is  found  in  closed  form. 

1  Introduction 

Kerdock  codes  K  (m)  are  nonlinear  codes  having  many  interesting  proper¬ 
ties,  such  as  high  error  correcting  capabilities,  high  symmetry  and  beautiful 
descriptions,  combined  with  low  rates  for  great  m’s.  They  may  be  viewed  in 
some  way  as  dual  codes  of  Preparata  codes  P(m),  smother  noteworthy  class 
of  nonlinear  codes.  The  code  K(4)  is  very  interesting  because  besides  the 
relatively  high  rate  1/2,  it  coincides  with  the  Preparata  code  P( 4),  so  that 
it  looks  like  a  sort  of  self-dual  nonlinear  code. 

The  most  obvious  application  of  Kerdock  codes  is  their  use  as  chan¬ 
nel  codes  in  communication  systems.  K  (4)  may  also  be  viewed  as  the 
Nordstrom- Robinson  code  Mu,  and  used  as  a  vector  quantizer  for  encoding 
random  waveforms  such  as  in  the  case  of  speech  Linear  Predictive  Coding 

'This  paper  will  be  presented  at  IEEE  International  Symposium  on  Information  The¬ 
ory,  Kobe,  JAPAN,  June  1988. 

’This  work  was  financially  supported  in  part  by  the  United  States  Army  through  its 
European  Research  Office,  under  grant  n.  DAJA4S-86-C-0044. 
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Table  1:  Weight  distribution  for  K(m). 


(LPC)  at  the  rate  of  1/2  bit  per  sample  [3j.  In  a  similar  way  Kerdock  codes 
allow  the  decoding  from  data  produced  by  soft  demodulation.  In  both  vector 
quantization  and  soft  data  decoding  the  problem  is  to  minimize  an  objective 
function,  which  most  frequently  is  taken  to  be  the  squared-error  distortion. 

In  Section  2  a  definition  of  K  (m)  as  systematic  code  is  given  that  allows 
instantaneous  encoding  and  the  application  of  different  strategies  for  a  com¬ 
plete  decoding,  as  it  will  be  described  in  Section  3.  The  application  of  K  (4) 
to  vector  quantisation  will  also  be  described  in  Section  3.  A  short  analy¬ 
sis  of  the  computational  complexity  pertaining  to  the  above  applications  of 
K  (4)  will  be  given  in  Section  4.  Finally,  Section  5  reports  some  results  on 
the  performance  evaluation  of  K  (4)  used  as  a  channel  code  on  the  Binary 
Symmetric  Channel  (BSC). 


2  Encoding 

In  this  Section  we  briefly  recall  the  formal  definition  of  Kerdock  codes  in 
order  to  introduce  a  systematic  encoding  scheme.  We  also  collect  some  of 
its  general  properties  for  easy  reference. 

The  Kerdock  code  K(m),  m  even,  is  a  nonlinear  code  consisting  of  the 
Reed-Muller  code  of  parameters  (2m,  m  +  1, 2m/1)  and  2m_l  -  1  cosets  of 
*(l,m)  in  *(2,m).  Usually  K(m)  is  denoted  by  (2m,2*",2m-1  -  2m/,-,l. 
Important  features  of  any  code  are  the  weight  and  the  distance  distributions. 
The  weight  distribution  of  a  [n,  M,d]  code  is  the  set  {Ai}"_0,  where  A, 
denotes  the  number  of  codewords  of  weight  i,  while  the  distance  distribution 
is  the  set  (Bj  }"_<),  where  M  B,  is  the  number  of  ordered  pairs  of  codewords 
such  that  the  distance  between  them  is  «.  Linear  codes  have  B<  =  A,-  and 
the  same  property  is  shown  by  Kerdock  codes.  The  weight  and  distance 
distribution  of  £(m),  taken  from  [1],  is  given  in  Table  1. 

Let  eg  be  a  vector  of  J?(l, m).  For  later  use  it  is  convenient  to  interpret 
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eg  according  to  the  following  decomposition 

eg  =  {  *i  I  °  I  4  )•  (1) 

« —  m  +  1  — *  * —  m  —  1  — ►  « —  2"*  -  2m  — * 

Let  Wi  be  a  coeet  leader  that  performs  a  translation  of  £(l,m)  to  generate 
a  codeword  e  of  <  (m),  i.e. 

e  =  »j  +  ejt; 

the  code  < (m)  can  be  written  as  the  union  of  disjoint  cosets  of  £(l,m)  as 
follows 

K(m)  =  [«i  +  J2(l,m)j  U  [at]  +  R(l,  m)j  U  . . .  U  [wj-  +  £(l,m)].  (2) 

The  definition  of  K (m)  strongly  lies  on  the  choice  of  the  «\’8,  i  =  1, . . .  ,2m, 
which  may  be  obtained  by  means  of  primitive  idempotents  for  length 
2m_1  —  1,  or  by  using  simplectic  forms  to  define  a  convenient  set  of  boolean 
functions.  A  very  simple  construction  of  K  (4)  is  given  in  [2]  were  the  cosets 
leaders  are  defined  through  simplectic  forms,  very  easy  to  obtain,  on  four 
variables. 

For  easy  reference,  it  is  convenient  to  introduce  a  binary  matrix  built 

with  the  coeet  leaders  written  by  rows,  an  example  of  which  is  given 
by  (5). 

Now  we  use  a  systematic  JE(l,m)  code  and  its  translates,  given  by  coset 
leaders  Wi  of  a  special  form,  to  generate  K(m). 

Let  «'  =  (  «i  |  i]  )  be  a  vector  of  2m  information  bits.  As 

« —  m  +  1  — ►  < —  m  -  1  — » 

a  consequence  of  next  Lemmas  1  and  2,  w%  may  be  taken  of  the  form 

=  (®  i  **  i  <o 

and,  referring  to  equation  (1),  we  set  *1  =  »i,  so  that  the  codewords  will 
result  of  the  form 

e  =  (  *1  I  #  +  »J  I  *  +  «*  )•  (3) 

♦—  m  +  1  — *  « —  m  -  1  — *  «—  tm  -  2m  — f 

The  general  properties  that  have  been  used  to  define  K  (m)  are  formally 
stated  in  the  next  two  Lemmas. 

Lemma  1  In  every  covet  of  a  systematic  linear  (n,t,d)  code  there  exists 
exactly  one  word  with  k  contecntivc  seres  in  information  positions. 
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Lemma  2  In  the  matrix  there  exists  a  submatrix  made  of  m  -  1 

columns  whose  rows  are  the  2m_1  different  binary  sequences  of  m  -  1  bits. 

The  proof  of  these  two  Lemmas  is  easy  to  obtain,  and  it  will  be  omitted 
here. 

From  the  form  (3)  it  is  seen  that  instantaneous  encoding  is  possible.  In 
fact  the  first  m  +  1  bits  can  be  transmitted  while  they  enter  the  encoder. 
At  the  (m  +  l)-th  bit  the  remaining  parity  check  bits  for  the  £(1,  m)  code, 
i.e.  the  vectors  a  and  6,  can  be  computed.  As  the  remaining  m  -  1  infor¬ 
mation  bits  enter  the  encoder,  they  are  summed  with  the  entries  of  vector 
a  and  transmitted.  At  that  point  the  coset  leader  (hence  the  vector  d) 
is  known,  so  that  the  remaining  parity  check  bits  can  be  computed  as  b  +  d 
and  transmitted. 

The  above  results  applied  to  K(4)  let  the  generating  matrix  of  the  un¬ 
derlying  £(1,4)  code  be  written  in  the  form 

/  10000  111  01101001  \ 

01000  110  11010101 

GX  =  (G,  |  Gj  |  Gs)  =  00100  101  10110011  ;  (4) 

00010  011  10001111 

V  00001  000  01U1111 

and  correspondent^  the  matrix  VVW  of  coset  leaders  in  the  form 

00000  000  00000000  \ 

00000  001  11100101 

00000  010  10111001 

00000  100  11001011  ,-v 

00000  011  01010011  y  ’ 

00000  101  00011101 

ooooo  no  ooiooin 

00000  111  11111110 

It  must  be  noted  that  the  2m  information  positions  in  the  code  vectors  e 
defined  by  equation  (3)  contain  all  the  possible  2*m  binary  sequences  of  such 
length,  therefore  the  Kerdock  code  could  be  viewed  as  a  strictly  systematic 
code  at  the  cost  of  loosing  the  orderly  definition  reported  above. 
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3  Decoding  and  Quantization 

In  this  Section  some  procedures  for  decoding  Kerdock  codes  and  some  pro¬ 
cedures  for  performing  the  vector  quantization  based  on  Kerdock  codes  are 
described.  As  a  consequence  of  the  definition  given  in  Section  2,  the  problem 
of  decoding  Kerdock  codes  may  be  formulated  as  follows: 

given  a  received  word  r,  find  the  pair  [$>it  e%)  made  of  a  coeet 
leader  and  a  Reed-Mullcr  codeword,  inch  that  the  decoded 
codeword  e  —  A; +  2*  satisfies  the  chosen  decoding  criterion. 

Several  decision  rules  may  be  considered,  their  main  difference  lying  in 
the  manner  adopted  to  resolve  ties  whenever  more  than  [^J  errors  are 
detected,  since  these  codes  are  not  perfect.  In  particular  two  strategies 
deserve  special  interest:  the  Maximum  Likelihood  (ML)  and  the  Minimum 
Correction  (MC)  rules.  They  are  defined  as  follows. 

Maximum  Likelihood  rule:  r  is  decoded  as  the  codeword  2  that  max¬ 
imizes  the  conditional  probability  p{e  |  r}.  On  BSC  this  rule  coincides 
with  the  minimum  distance  decoding,  i.e.  r  is  decoded  as  the  codeword  2 
corresponding  to  the  minimum  distance  d(e,r). 

Minimum  Correction  rule:  r  is  decoded  as  the  codeword  at  the  min¬ 
imum  distance  if  the  distance  is  less  or  equal  to  [^J;  otherwise  the  in¬ 
formation  bits  are  extracted  from  the  received  word  without  any  correction 
attempt. 

Four  decoding  algorithms  will  be  now  described,  based  on  the  arithmetic 
in  GF(2).  Let  us  remind  that  the  Hamming  weight  wt(z)  of  a  vector  z  is 
the  number  of  its  nonzero  components  and  let  us  introduce  the  vector  r, 
decomposed  as  in  (1) 

r  =  (r,  |  r,  |  rs), 
that  will  be  referred  to  as  the  received  vector. 

Algorithm  1  (Minimum  distance). 

Store  Cj  in  a  table  T . 

•  Input  r. 

•  Compute  the  2,m  Hamming  distances 

Vi  =  wt(r  -  e<),  i  =  l,...,2*m. 

•  Find  the  minimum  y*.  Ties  are  resolved  by  random  equiprobable 
choices. 
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1 

u 

0 

1 

1 

16 

2 

120 

3 

112 

4 

7 

Table  2:  Weight  distribution  of  ML  correctable  error  patterns  for  K(m). 

•  Decode  r  as  the  codeword 

•  Recover  t  from 


□ 

The  next  algorithm  is  restricted  to  £(4),  as  it  takes  advantage  from  the 
fact  that  Standard  Array  decoding  is  possible.  In  fact  by  computer  search 
256  correctable  error  patterns  have  been  found  such  that  the  translates 
li  +  K( 4)  do  not  overlap  and  cover  the  whole  space  GF(2)16.  The  weight 
distribution  {L$}*“0  the  correctable  error  patterns  is  reported  in  Table  2, 
where  L,  denotes  the  number  of  error  patterns  of  weight  i.  From  this  Table, 
the  fact  that  K  (4)  is  not  quasi-perfect  can  be  observed. 

Algorithm  2  (Syndrome  decoding  -  ML  rule). 

Store  Hg,  the  parity  check  matrix  of  £(1,4). 

Store  the  vectors  *,■  =  HgW j,  i  =  1, ...  ,2s. 

Store  the  vectors  u;  =  Hglj,  j  =  1,. . .  ,2®. 

•  Input  r. 

•  Compute  the  syndrome  s  =  Hgr. 

•  Find  the  pair  (&;,£«]  that  sums  to  s. 

•  Recover  <,  from  u; . 

•  Decode  r  as  e  =  r  -f 

•  Recover  «  from  i. 

□ 

There  are  good  reasons  to  conjecture  that  this  decoding  algorithm  may  also 
be  applied  to  decode  K(m),  for  every  m. 

Algorithm  2  can  be  adapted  to  the  MC  decoding  rule  as  follows. 
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Algorithm  3  (Syndrome  decoding  -  MC  rule). 

Store  a  table  T  of  8  x  137  syndrome  vectors  associated  to  error  pattern 
of  weight  not  greater  than  2. 

Store  Hz,  the  parity  check  matrix  of  £(1,4). 

Store  Gz,  the  generating  matrix  of  £(1,4). 

•  Input  r. 

•  Compute  the  syndrome  s  =  /fgr. 

•  Search  for  the  error  pattern  e  in  T,  using  the  entry  t. 

•  IF  successful 

THEN  decode  r  as  e  =  r  +  i 
recover  s'  from  e 

ELSE  take  the  first  m  4  1  information  bits  unmodified:  tj  =  rj, 
compute  a  =  Gj  s'i 

and  take  the  remaining  m —  1  information  bits  as  s’j  =  rj  +  a. 

□ 

Algorithm  4  (Tabular  Decoding). 

Store  a  Table  Ti  of  the  indices  j’s  associated  to  the  error  patters  l;  ’s, 
for  every  r  €  GF(2)W. 

Store  a  Table  Tj  of  the  error  patterns  tj'*,  j  ~  1, . . . ,  2®. 

•  Input  r. 

•  Get  j  from  r  using  Table  Ti . 

•  Get  tj  from  j  using  Table  Tj. 

•  Compute  i  =  r  +  tj. 

•  Recover  s'  from  e. 

□ 

Vector  quantization  is  a  field  where  AC(4)  has  found  a  valuable  applicar 
tion.  Let  us  formally  recall  the  vector  quantization  problem  with  minimum 
squared -error  distortion.  Let  *  =  (xi,...,xn)  e  B"  be  the  input  to  the  vec¬ 
tor  quantizer  and  let  {e<}£Li  be  the  set  of  codewords.  The  problem  may  be 
formulated  as  follows: 


7 


find  the  codeword  c,  among  the  N  possible  ones  which  min¬ 
imizes  the  squared  error 

II  *  -  ||1=  *Tz  -  2z TCi  +  cf  ej,  (6) 

where  if  cj  e,  is  independent  oft  then  the  minimum  distance 
is  achieved  by  the  codeword  e,  yielding  the  largest  scalar  prod¬ 
uct  yj  =  *TCj. 

The  most  efficient  algorithms  known  today  for  performing  the  vector 
quantization  using  K.  (4)  are  based  on  the  Hadamard  Transform  (HT),  whose 
definition,  for  easy  reference,  will  now  be  recalled. 

Let  H„  denote  an  Hadamard  matrix,  which  isanxn  matrix  of  +  l’s 
and  -l’s  with  the  property  that  the  scalar  product  of  any  two  distinct  rows 
is  0.  Thus  H„  must  satisfy  the  relation 

HnHl  =  nl, 

where  I  is  the  n  x  n  identity  matrix. 

An  n-dimensional  column  vector  y  is  called  the  HT  of  the  vector  z  if  it 
is  obtained  multiplying  the  vector  z  by  an  Hadamard  matrix,  i.e. 

V  =  #«*• 

In  this  context  we  shall  consider  the  «-th  binary  codeword  of  K (m)  as 
a  vector  of  -f-l’s  and  -l’s,  with  +l’s  replacing  0’s  and  -l’s  replacing  l’s, 
this  will  replace  the  usual  vector  sum  over  GF(2)  with  the  Hadamard  dot 
component- wise  product  of  real  vectors,  hereafter  denoted  Q.  The  scalar  and 
Hadamard  products  are  compatible  in  the  sense  that  the  following  property 
holds 

zT(y©z)  =  (z  ©  y)r*.  (7) 

Vector  quantization  with  the  K (4)  code,  requires  the  computation  of  256 
scalar  products 

y i  =  zTCi,  i=l,...,  256,  (8) 

and  256  comparisons  to  search  the  minimum  &.  Applying  the  property  (7), 
y,  may  be  computed  as 

Vi  =  zr(«r,0eR)  =  (zQWj)T  c*.  (9) 

As  noted  in  (1,2,3],  the  32  codewords  of  £(1,4)  can  be  grouped  to  form 
a  Hadamard  matrix  H i#  and  its  negative  -Hie.  Therefore  the  y,’s  can 
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be  computed  as  HT’a  of  the  8  vectors  (z  ©  toy).  Moreover  only  128  scalar 
products  are  to  be  computed  because  K (4)  in  the  ±1  representation  contains 
both  Ci  and  -c<.  Finally  only  128  comparisons  are  necessary  to  find  the 
maximum  scalar  product;  the  search  can  be  limited  to  the  absolute  values 
(  xTCi  \  and  the  proper  codeword  can  then  be  chosen  according  to  the  sign 
of  ZTCj. 

The  above  observations  can  be  also  applied  to  the  minimum  distance  de¬ 
coding  of  soft  data.  The  computation  of  the  Algorithm  1  may  be  performed 
by  executing  the  HT’s  of  the  received  vector  r  and  the  seven  companion  vec¬ 
tors  r  ©  Wj,  j  =  2, . . .  ,8.  In  the  following,  two  algorithms  that  implement 
the  decoding  along  these  lines  are  described. 

Algorithm  5  (HT  -  ML  rule). 

Store  WW. 

Store  Hls. 

•  Input  r. 

•  Compute  the  28  scalar  products  y,-  =  rTCi,  i  —  1, . . . ,  2s,  by  performing 
the  16  HT’s 

HwirOWi)  *  =  2,..., 8 

-ffi«(r©w,)  t  =  2,...,  8. 

•  Find  the  maximum  y<.  Ties  are  resolved  by  random  equiprobable 
choices. 

•  Decode  r  as  the  codeword  cV 

•  Recover  t  from  c,-. 

□ 

Algorithm  6  (HT  -  MC  rule). 

Store  WW. 

Store  Hiq. 

Store  Gjj. 

•  Input  r. 
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•  Compute  the  2s  scalar  products  y,  =  rTe,-,  i  —  1, . . . ,  2®,  by  performing 
the  16  HT’s 

Hie(rO»>)  i=l,...,  8 

-Hi6(rQWi)  «'  =  1, ...  ,8. 

•  Find  the  maximum  ft. 

•  IF  no  ties 

THEN  decode  r  as  the  codeword  ei 
recover  »  from  e< 

ELSE  take  the  first  m  +  1  information  bits  unmodified:  t'i  =  n, 
compute  a  =  Gj  ii 

and  take  the  remaining  m- 1  information  bits  as  »j  =  rz  +  a. 

□ 

4  Computational  Complexity 

Every  dissertation  on  decoding  complexity  suffers  the  lacking  of  suitable 
measures  of  complexity.  However  for  most  practical  applications  the  number 
of  arithmetical  operations  (in  any  field),  the  number  of  logical  operations 
and  the  amount  of  storage  required  can  be  taken  a a  meaningful  figures.  In 
the  following  the  complexity  for  decoding  K  (4)  with  both  the  ML  and  the 
MC  rules  is  estimated.  The  complexity  of  vector  quantization  using  K  (4)  is 
also  analyzed  and  a  significant  improvement  over  [3]  is  obtained. 

Decoding  .  The  complexity  for  decoding  Algorithm  2  of  the  previous  Sec¬ 
tion  is  briefly  discussed  in  the  following.  Similar  considerations  can 
be  applied  to  all  the  algorithms  presented.  Complexity  results  are 
summarized  in  Table  3. 

Complexity  of  Algorithm  B.  We  have  2U  syndromes  of  11  bits  which 
correspond  to  the  2s  x  2®  pairs  (to , •,/,);  they  can  be  precomputed  as 

HtWi  +  HtBi  i  —  1, ...  ,8;  y  =  1, . . . ,  256.  (10) 

The  decoding  operations  require  the  computation  of  s,  a  table  look  up 
for  finding  and  tj ,  and  finally  the  computation  of  r  +  /,  to  obtain 
the  correct  information  bits.  The  complexity  results  in 

•  the  storing  of  Hz,  requiring  11  x  16  bits; 
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•  the  storing  of  2U  x  8  bits,  i.e.  the  reference  to  the  suitable  tj  for 
every  s; 

•  the  storing  of  28  x  16  bits,  i.e.  the  /,•’ s; 

•  15  x  11  sums  in  GF(2)  for  the  evaluation  of  s  =  Hgr; 

•  a  direct  access  search  in  a  table  of  211  entries  to  get  j  and  then 
in  a  second  table  of  28  entries  to  get  ; 

•  the  computation  of  the  correct  information  bits  requiring  either 

-  the  storing  of  the  Wj’s,  requiring  8  x  16  bits,  and  the  compu¬ 
tation  of  5  sums  in  GF(2)  to  find  the  »j  bits; 

-  the  storing  of  the  tj  bits  for  every  codeword,  requiring  256  x  3 
bits,  and  a  direct  table  access. 

This  complexity  is  the  same  for  ML  and  MC  rules,  the  only  difference 
being  different  choices  of  the  correctable  error  patterns. 

Operating  in  real  fields  according  to  Algo  ithms  5  and  6  is  convenient 
only  for  minimum  distance  decoding;  the  complexity  ia  the  same  as 
in  the  case  of  vector  quantization,  as  analyzed  in  the  following,  but  it 
does  not  compare  favorably  with  the  decoding  in  GF(2)  (see  Table  3). 

Quantization.  In  [3),  by  referring  to  a  definition  of  K (4)  as  .Vis,  it  is  shown 
that  the  nearest  neighbor  codeword  can  be  found  with  304  additions 
and  128  comparisons.  By  using  the  same  argument,  introduced  in  [3], 
based  on  a  variant  of  the  Fast  Hadamard  Transform  (FHT),  and  using 
out  definition  of  K{ 4),  we  will  show  that  288  additions  are  sufficient. 
The  proof  stems  on  the  following  observations,  motivated  in  [1,2,3], 

1.  The  HT  of  dimension  16  may  be  computed  by  evaluating  HT’s 
of  smaller  dimension.  The  matrix  Hu  may  always  be  written  as 

H4  Ht  H<  \ 

it  _  (  Ht  H»  \  _  H4  -Hi  Hi  -Hi 

16  V  Ht  -Ht  )  Hi  Hi  -Hi  -Hi  ’ 

Hi  -Hi  -Hi  Hi  J 

bom  the  above  observations  it  follows  that  a  HT  of  dimension 
16  may  be  computed  by  performing  two  HT’s  of  dimension  8  and 
operating  16  sums,  and  every  HT  of  dimension  8  may  by  obtained 
from  two  HT’s  of  dimension  4  and  8  sums.  The  FHT  of  dimension 
4  requires  8  additions. 
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2.  It  is  direct  to  verify  that  the  HT’s  of  a  4-dimensional  vector  a  with 
an  even  number  of  components  having  the  sign  changed,  msy  be 
obtained  from  the  HT  of  a  by  simple  permutations  and  changes  of 
signs.  In  fact  let  aT  ~  (01,01,03,04)  be  a  4-dimensional  vector 
whose  HT  is  the  vector  bT  =  (61,67,63,64)  given  by  6  =  H«o, 
where  the  matrix  has  the  structure 


{  1  1  1  1\ 


V  i  -1  -1  1/ 


If  all  four  Oj’s  change  the  sign  then  the  four  6,’s  do  the  Bame. 
If  only  two  0,  ’s  change  the  sign  then  the  four  6,’s  are  permuted 
and/or  changed  of  sign  according  to  the  following  correspondence 


(-01,-ai,  03,  04) 
(  01,  at,  -os,  -04) 
(-01,  ai,-a$,  04) 
(  01, -oi,  03,-04) 
(-01,  01,  03,-04) 
(  “i, -01,-os,  04) 


(-63,-64,-61,-61) 
(  63,  64,  61,  61) 
(-61,-61,-64,-63) 
(  61,  61,  64,  6s) 
(-64,  -6j,  -bt,  -6j) 

(  64,  6s,  6i,  6j). 


3.  By  permuting  some  columns  in  the  matrix  (5)  we  get 


(  0000  0000 
0000  0001 

t  0000  0011 

0000  0101 
0000  0011 
0000  0101 
0000  0111 

1  V  0000  0110 


0000  0000  \ 
1010  1101 
1111  0000 
1100  1010 
0001  1011 
0101  0101 
0010  0110 
1111  1111  / 


(11) 


4.  Due  to  the  particular  partition  of  the  vectors  wt,  as  shown  by 
the  rows  of  matrix  (11),  many  of  the  HT’s  of  dimension  4  need 
not  to  be  computed  8  times.  In  fact  if  both  vectors  x  and  to,-  are 
partitioned  into  four  parts  of  the  same  size 


(»i  I  *»  I  *s  I  *4) 
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and 

(«i.  |  »»i  |  |  »4 0, 

their  dot  product  may  be  performed  independently  in  each  single 
part 

*©».'  =  (*1  0  »li  |  *3  ©  « *2%  |  *S  ©  Wj,  |  *4  ©  »4<)- 

Therefore,  the  above  observation,  together  with  the  property  re¬ 
ported  in  the  previous  item  2,  yields  the  conclusion  that  only  the 
HT’s  of  the  following  seven  vectors  must  be  computed 

*i  and  0(1,1, 1,-1) 

*s  and  *s  ©  (1,1, 1,-1) 

*4  and  *4  0(1,1, -1.1). 

5.  In  combining  the  HT’s  pertaining  to  *j  and  Z4  to  get  the  HT’s 
of  dimension  8,  only  5  out  of  8  combinations  are  necessary.  This 
comes  form  the  fact  that  two  vector  sums  are  to  be  performed 

*U  =*10»»  +  *4©»4i 


MU  =  *S  O  ~  *4  ©  »4.- 

In  computing  *u  and  *}«>  for  i  =  1,. . .  ,8,  the  cases  «  =  3,6,8  do 
not  need  any  addition,  since 

*13  = 

*33  =  -*11 
*1S  =  -*n 
*33  =  -*21 

*ie  is  a  permutation  of  -  *u 
*23  is  a  permutation  of  —  zjj. 
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ESI 

real  sums 

E00EE3 

memory 

3540 

4096 

4096  bit 

255 

- 

144 

4352  bit 

- 

- 

140 

12312  bit 

- 

- 

16 

528384  bit 

- 

5 

288 

- 

384  bit 

128 

6 

288 

16 

512  bit 

128 

Table  3:  Complexity  figures  for  decoding  K  (4) 


6.  In  conclusion  the  total  number  of  additions  is 

128  =  8  x  16;  number  of  sums  required  for  combining 
8  pairs  of  HT’s  of  dimension  8; 

56  =  7  x  8;  number  of  sums  required  for  the  evalu¬ 
ation  of  seven  HT’s  of  dimension  4; 

64  =8x8;  number  of  sums  required  for  combining 
the  HT’s  pertaining  to  >i  and  *2 
40  =  5  x  8;  number  of  sums  required  for  combining 
the  HT’s  pertaining  to  xs  and  *4 

2138 


5  Bit  Error  Probability 

In  general  it  is  very  hard  to  compute  either  bit  error  rate  or  word  error 
rate  for  nonlinear  codes.  For  K  (4),  however,  such  a  computation  is  feasible 
because  its  structure  is  very  similar  to  that  of  linear  codes.  As  previously 
observed,  the  decoding  can  be  organized  as  a  Standard  Array,  since  the 
translates  of  K  (4)  by  the  correctable  error  patterns  do  not  overlap  and  cover 
the  whole  vector  space  of  dimension  16  over  GF(2). 

The  polynomial  expression  of  the  bit  error  probability  p»  of  K  (4)  after 
complete  decoding  on  the  BSC,  in  terms  of  p,  raw  bit  error  rate  of  the  BSC, 
has  the  form 

1  18 

P»  =  g  ]C  %  P‘- 

0  1=0 

where  the  coefficients  B,  are  reported  in  Thble  4.  They  have  been  computed 
from  the  Standard  Array  according  to  a  counting  scheme  proposed  in  [1]. 
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1 

ML 

MC 

0 

0 

0 

1 

0 

0 

2 

0 

0 

3 

1464 

1329 

4 

-12635 

-10856 

5 

52116 

40497 

6 

-130242 

-81309 

7 

211196 

65510 

8 

-218250 

110310 

9 

117176 

-409012 

10 

22836 

675936 

11 

-99288 

-704544 

12 

88496 

496376 

13 

-43680 

-233184 

14 

12480 

66912 

15 

-1664 

-8960 

16 

0 

0 

Table  4:  Coefficient*  for  BER  computation  of  If  (4). 


Interesting  are  the  asymptotic  expressions  p*  x  ^p*p3  and  p4  x  l^pp* 
for  the  ML  and  MC  decoding  respectively,  as  p  tends  to  zero.  From  these 
relations  it  follows  that,  at  least  asymptotically,  the  MC  rule  is  superior 
to  the  ML  rule.  These  theoretical  results  have  been  checked  by  simulation 
using  the  TOPSIM  package  [7], 


6  Conclusions 

In  this  paper  we  have  dealt  with  many  different  properties  of  Kerdock  codes. 
Disparate  results  have  been  put  together  because  they  represents  the  man¬ 
ifold  aspects  of  this  very  interesting  class  of  nonlinear  codes. 

This  final  Section  summarizes  the  many  results  that  were  obtained.  First 
of  all  a  description  of  Kerdock  codes  that  allows  instantaneous  encoding  is 
given.  This  approach  leads  to  the  application  of  two  different  decoding 
strategies,  i.e.  the  well  known  Maximum  Likelihood  criterion  and  another 
one  that  we  have  called  Minimum  Correction  rule.  Referring  to  K( 4)  it 
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has  been  shown  that  a  Standard  Array  can  be  built  by  translating  the  set 
of  codewords  without  overlapping.  From  the  inspection  of  this  Standard 
Array  it  turns  out  that  K(4)  is  not  quasi- perfect  (see  also  Table  2).  The 
same  Standard  Array  allows  the  computation  of  the  bit  error  rate  for  K  (4) 
on  the  binary  symmetric  channel,  with  respect  to  both  ML  and  MC  decoding 
strategies:  in  this  particular  case  MC  is  asymptotically  superior. 

Finally  it  has  been  shown  that  a  vector  quantization  scheme  based  on 
K  (4)  can  be  performed  with  only  18  suras  and  8  comparisons  per  sample. 
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